Algorithms for Nash Equilibria in General-Sum Stochastic Games
نویسندگان
چکیده
Over the past few decades the quest for algorithms to compute Nash equilibria in general-sum stochastic games has intensified and several important algorithms (cf. [9], [12], [16], [7]) have been proposed. However, they suffer from either lack of generality or are intractable for even medium sized problems or both. In this paper, we first formulate a non-linear optimization problem for stochastic games and then break it down into simpler subproblems that ensure there is no Bellman error for a given state and agent. Next, we derive a set of novel necessary and sufficient conditions for solution points of these sub-problems to be Nash equilibria of the underlying game. Using these conditions, we develop two novel algorithms OFF-SGSP and ON-SGSP,respectively. OFF-SGSP is an off-line centralized algorithm which assumes complete information of the game. On the other hand, ON-SGSP is an online decentralized algorithm that works with simulated transitions of the stochastic game. Both algorithms are guaranteed to converge to Nash equilibrium strategies for general-sum (discounted) stochastic games.
منابع مشابه
Two-Timescale Algorithms for Learning Nash Equilibria in General-Sum Stochastic Games
We consider the problem of finding stationary Nash equilibria (NE) in a finite discounted general-sum stochastic game. We first generalize a non-linear optimization problem from [9] to a general N player game setting. Next, we break down the optimization problem into simpler sub-problems that ensure there is no Bellman error for a given state and an agent. We then provide a characterization of ...
متن کاملA Study of Gradient Descent Schemes for General-Sum Stochastic Games
Zero-sum stochastic games are easy to solve as they can be cast as simple Markov decision processes. This is however not the case with general-sum stochastic games. A fairly general optimization problem formulation is available for general-sum stochastic games by Filar and Vrieze [2004]. However, the optimization problem there has a non-linear objective and non-linear constraints with special s...
متن کاملStochastic Learning of Equilibria in Games: The Ordinary Differential Equation Method
Our purpose is to discuss stochastic algorithms to learn equilibria in games, and their time of convergence. To do so, we consider a general class of stochastic algorithms that converge weakly (in the sense of weak convergence for stochastic processes) towards solutions of particular ordinary differential equations, corresponding to their mean-field approximations. Tuning parameters in these al...
متن کاملFast Planning in Stochastic Games
Stochastic games generalize Markov decision processes (MDPs) to a multiagent setting by allowing the state transitions to depend jointly on all player actions, and having rewards determined by multiplayer matrix games at each state. We consider the problem of computing Nash equilibria in stochastic games, the analogue of planning in MDPs. We begin by providing a generalization of nite-horizon v...
متن کاملOn Nash Equilibria in Stochastic Games
We study in nite stochastic games played by n-players on a nite graph with goals given by sets of in nite traces. The games are stochastic (each player simultaneously and independently chooses an action at each round, and the next state is determined by a probability distribution depending on the current state and the chosen actions), innite (the game continues for an in nite number of rounds),...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1401.2086 شماره
صفحات -
تاریخ انتشار 2014